Picture for Junlan Feng

Junlan Feng

China Mobile Research Institute, Beijing, China

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Add code
Jun 01, 2026
Viaarxiv icon

ChildEval: When large language models meet children's personalities

Add code
May 27, 2026
Viaarxiv icon

JT-SAFE-V2: Safety-by-Design Foundation Model with World-Context Data

Add code
May 23, 2026
Viaarxiv icon

Strategy-Aware Optimization Modeling with Reasoning LLMs

Add code
May 04, 2026
Viaarxiv icon

SeaEvo: Advancing Algorithm Discovery with Strategy Space Evolution

Add code
Apr 27, 2026
Viaarxiv icon

DR$^{3}$-Eval: Towards Realistic and Reproducible Deep Research Evaluation

Add code
Apr 16, 2026
Viaarxiv icon

GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning

Add code
Mar 24, 2026
Viaarxiv icon

CCR-Bench: A Comprehensive Benchmark for Evaluating LLMs on Complex Constraints, Control Flows, and Real-World Cases

Add code
Mar 09, 2026
Viaarxiv icon

Beyond One-Size-Fits-All: Adaptive Subgraph Denoising for Zero-Shot Graph Learning with Large Language Models

Add code
Mar 03, 2026
Viaarxiv icon

B-GRPO: Unsupervised Speech Emotion Recognition based on Batched-Group Relative Policy Optimization

Add code
Feb 06, 2026
Viaarxiv icon